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Abstract. In this paper, we shall prove the Chung-Feller Theorem in several ways. We 
provide an inductive proof, bijective proof, and proofs using generating functions, and 
the Cycle Lemma of Dvoretzky and Motzkin |2J. 



1. Introduction 

The main focus of this paper is to prove the following result of Chung and Feller pQ: 

The Chung Feller Theorem. The number of paths from (0,0) to (n,n), with steps 
(0, 1) and (1,0) and exactly 2k steps above the line x = y is independent of k, for every 
k such that < k < n. In fact, it is equal to the nth Catalan number. 

2. Preliminary Definitions and an Inductive Proof 

A Dyck path is a path with steps (0, 1) and (1,0) that starts at the origin and ends 
at (n, n) for some positive integer n. We can study the number of paths indicated in 
the above Chung-Feller theorem in a simpler form if we generalize Dyck paths as seen in 
PJ. A k-negative path of length 2n is a path from (0, 0) to (2n, 0), with steps (1, 1) and 
(1,-1), such that exactly 2k of these steps are below the horizontal axis. 

A prime Dyck path is a Dyck path that returns to the x-axis only once at the end of 
the path. A negative prime Dyck path is a a prime Dyck path reflected about the x-axis. 
In general, any negative Dyck path is the reflection about the x-axis of a Dyck path. 

An Inductive Proof of the Chung-Feller Theorem. As a base case, for n = 0, there is only 
one path with steps. So clearly there is only Co = 1 path with steps below the x-axis. 

Now, let n > and assume that for all % such that < % < n, the number of /-negative 
paths of length 2i is equal to Ci for all I < i. Therefore, choose some k < n. 

Any nonempty ^-negative path either starts out with either a prime Dyck path or a 
negative prime Dyck path. In the first case, the fc-negative path starts with an up-step 
followed by a Dyck path of some length 2p — 2 which is then followed by a down-step and 
a fc-negative path of length 2n — 2p, for some p < n — k. On the other hand, the second 
case deals with paths that start out with a down-step, followed by a (q — l)-negative path 
of length 2q — 2 which is followed by an up-step and a (k — g)-negative path of length 
2n — 2q, for some q < k. 

Let A/" up be the number of ^-negative paths of length 2n that start with an up-step. 
Let A/down, similarly, be the number of k- negative paths of length 2n that start with a 
down-step. Let M be the total number of ^-negative paths of length 2n. 

If a path starts out with an up-step and a Dyck path of length 2p — 2, then the number 
of such paths is the total number of Dyck paths of length 2p — 2 multiplied by the number 
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of ^-negative paths of length 2n — 2p< 2n. By the inductive hypothesis, this means that 
the number iV+ of such paths is C p -iC n - p . For the total number of /c-negative paths that 
start off with an up-step we need to take the sum of iV+, for all 1 < p < n — k. Therefore, 
we have that the total number of A;-negative paths of length 2n that start with an up-step 
is 

n—k n—k 
p=l p=l 

Similarly, we define N~ as the number of £;-negative paths that start out with a down- 
step and a negative Dyck path of length 2q — 2. So N~ is equal to the number of negative 
Dyck paths of length 2q — 2 multiplied by the number of (k — g)-negative paths of length 
2n — 2q < 2n. Obviously, since the negative Dyck paths are reflections of (positive) 
Dyck paths, the number of negative Dyck paths of length 2q — 2 is equal to the number 
of Dyck paths of length 2q — 2. Therefore, we again have the sum of products of two 
Catalan numbers, and the total number of A;-negative paths of length 2n that start with 
a down-step is 

k k 

A/down — ^ ] Cq-\C n - q = ^ ] C n - q Cq-\. 
q=l q=l 

Therefore the total number of A;-negative paths of length 2n is equal to 



(1) Af — Af up + A/down — V Cp-lCn-p + V Cn-qC q -l, 



n—k k 

yi c p-i c n-p + ^2 

p=l q=l 



that is, 

A/" = (CoC n _i + C\C n -2 + • • • + C n -k-iCk) + (C n -iCo + C n -2C\ + • • • + C n -kCk-i) 
Reversing the order of the summands in the second parentheses clearly gives 
M = (CoC n _i + • • • + C n __fc_iCfc) + (C n _fcCfc_i ■ • • + C„_iC ) 



n-1 



— CiC n - 
i=0 

Now, for any Dyck path of length 2n we can look at the first nonempty prime path. It 
will have length 2% + 2 for some % > 0, giving a total of Cj such prime paths. Since the 
total length of the entire path is 2n, we must have i < n — 1. Therefore, since the rest of 
the path is an arbitrary Dyck path of length 2n — 2i — 2 there are C„_j_i possible paths 
that can follow the initial prime path of length 2i + 2. Therefore, the total number of 
Dyck paths of length 2n that start with a prime path of length 2i + 2 is given by C;C„_i_i. 
Therefore if we sum over < i < n - 1 we will get all possible Dyck paths of length 2n, 
which is given by C n . Therefore 

n-1 

A/" = ^ ] CjC n _j_i = C n , 

i=0 

and so the total number of A;- negative paths of length 2n is equal to C n . □ 
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3. A Bijective Proof 
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Let Sk denote the set of all /c-negative paths of length 2n. Choose some k, such that 
< k < n. 

For s G Sk, find the last positive prime Dyck path q. Next factor s into s = pqr, where 
p is the path up to the last positive prime Dyck path and r is the rest of the path after 
q. Since q is the last positive prime Dyck path, the remainder of the path r must be a 
negative Dyck path. This factorization is unique. 

Like any other prime Dyck path, q is composed of an up-step followed by an arbitrary 
Dyck path Q and a down-step. So q can be rewritten as q = uQd, where u denotes an 
up-step and d denotes a down-step. Define the function tp + : Sk — > Sk+i by 

(2) f+( s ) — <£+(puQdr) = pdruQ, for anys G Sk- 

Given an s G Sk, <p+{s) = pdruQ by (J2J). Since p begins and ends on the the x-axis, 
the step d is below the x-axis. As noted earlier r is an arbitrary negative Dyck path, so 
in the path ip + (s), r starts and ends at height —1, without going above it. Therefore the 
step u is negative, starting at height —1 and ends on the x-axis. Since Q is a Dyck path 
it contains no negative steps, so altogether (p+(s) has 2 steps below the x-axis in addition 
to the number of negative steps in path segments p and r. Since the only negative steps 
in the path s G Sk occur in p and r, the number of negative steps in path segments p and 
r is equal to 2k. Thus (p+(s) has 2k + 2 steps below the x-axis and so is a member of 
Sk+i- 

Now for any a G Sk+i we can find the last negative prime path to. Thus we can write 
cr = mug, where tc is the path up to the last negative prime, and g is the rest of the path 
which is by construction positive. So since a; is a negative prime path it can be written as 
dflu, where Q is an arbitrary negative Dyck path. Therefore any a G S k +i can be uniquely 
written as a = ndQug, where Q is a negative path and g is a positive path. Therefore, if 
si = p\uQ\dr x ^ s 2 = p 2 uQ 2 dr 2 G S k then either Pi ^ p 2 , Qi ^ Q 2 or n ^ r 2 , so clearly 
since <p+(si) = pidriuQi, tp + (s 2 ) = p 2 dr 2 uQ 2 , since the factorization in Sk+i is unique, 
(f+(si) 7^ (p + (s 2 ). Therefore, tp + is injective. 

Using the factorization on S k +i we can define the injection tp_ : Sk+i — > Sk which sends 
cr = ndQug ^ nugdfl. Since there are now two fewer negative steps in f-(cr) than in 
cr itself, due to the fact that the up-step now comes before the down-step, <f-(u) G Sk- 
Additionally, since there is an unique decomposition in Sk, <p~ is injective. Thus, by the 
Scroder-Bernstein theorem \Sk\ — \Sk+i\. In fact, tp_ = cp^ 1 , since if-(pdruQ) = puQdr 
for all s = puQdr G Sk, so ip + is in fact a bijection. 

Therefore, we can now bijectively prove the Chung-Feller theorem. 

Bijective Proof. The cardinality of Sk and Sk+i are equal, for all nonnegative k < n — 1. 
Since there are n + 1 equal sets of paths of length 2n each with the same cardinality, and 
the total number of paths of length 2n is ( 2 ^) , we have the equation 

(n + 1)JC = ^ , 

where K, is the number of fc-negative paths of length 2n for any k such that < k < n. 
Therefore K = 4r( 2n ) = C n . □ 

n+l V n I n 
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4. A Generating Function Approach to the Theorem 



In general a generating function is a very useful tool used in enumerative combinatorics. 
The generating function of an infinite sequence {a n } can be thought of as the function 
for which the coefficient of x n in the power series expansion about x = 0, is a n . That is, 
if A(x) is the generating function for the sequence {a n }, then A(x) = Y^=o a nX n ■ 

The generating function can also be obtained more formally by weighting certain com- 
binatorial objects by a variable x. For instance, if {a n } represents the number of bloops 
with n glurps, then A(x) is obtained by weighting each glurp by x and summing over all 
bloops. We will use both formulations in this section to give a straight-forward proof of 
the Chung-Feller theorem. 

The generating function for the Catalan numbers is given by 

(3) c{x)=Y^C n x n = 

n=0 

which is obtained by weighting every step in a Dyck path by y/x and summing over all 
possible Dyck paths. 

Let us construct a generating function for arbitrary paths that end on the x-axis by 
weighting each step by y/x and each step below the x-axis by \/tx, such that a path with 
2n total steps, 2k of which are below the x-axis is given the weight t k x n . As described in 
section |21 the primes of a k- negative path are either positive prime Dyck paths or negative 
prime Dyck paths. Let P n denote the number of (positive) prime Dyck paths of length 
2n. 

Such a path of length 2n consists of an arbitrary Dyck path of length 2n — 2 sandwiched 
between an up-step and down-step. So the number of prime Dyck paths of length 2n is 
given by the (n — l)th Catalan number, that is P n = C n _i. Therefore the generating 
function for the prime Dyck paths is given by 

oo oo oo 

(4) p + (x) = ^ P nX n = ^ C n-lX n = X ^ C n X n = Xc(x) . 

71=1 71=1 71=0 

Similarly, for negative prime Dyck paths the generating function, now weighted by \/xi 
because each step in a negative prime Dyck path is below the x-axis is given by 

oo oo 

(5) P-(x,t) = ^2P n (tx) n = tx^2C n (tx) n = txc(tx). 

71=1 71 = 

So since arbitrary paths can be factored into I primes (either positive or negative) for 
some I > 0, the generating function for such paths is 



oo . 

f-f 1 - (p_ + p^ 



In this generating function the coefficient of t l x^ is the number paths of total length 2j 
and with 2i steps below the x-axis. 
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By the definition of c(x) and equations (jlj) and © we get that 

N(t,x) = -. — r- ^— 

1 — xc(x) — txc[tx) 

1 



, 1 - y/1 - Ax 1 - VI -Atx, 

fa; h tx ) 

v 2x 2tx ' 



2 



VI - Ax + VI - 4te' 
Rationalizing the denominator we see that 



yi - ax - yi - Atx 

(6) = 2^3T) 



1 - y/l-Ax- 1 + VI - Axt 
2x(l - 1) 

i (\ - yi -Ax i - yi - Axt 



1 - 1 \ 2x 2x 
1 



1 - t 



(c(x) — tc(tx)) 



1 / oo oo x 



1 -t . 

K n=0 n=0 

00 1 _ +n+l 

1 (J x n 

t "' 

n=0 



1 - ' " 



Using N(t, x) and a simple algebraic identity we can prove the Chung-Feller Theorem. 
Proof. Because 

1 - t n+1 n 

— ~ = 1 + t + t 2 + ■ ■ ■ + t U , 

equation (0) can be rewritten as 

oo / n \ 

(7) N(t,x) = J2 C - E* fe 

n=0 \fc=0 / 

Thus the coefficient of in N(t, x) is equal to C n for < k < n. Therefore the number 
of A;-negative paths of length 2n is equal to C n for < k < n. □ 

5. A Proof by Reordering 

The Cycle Lemma of Dvoretzky and Motzkin [2] is intricately linked with our main 
theorem. In the author provides the following proof of the Cycle Lemma using an 
ordering which naturally reveals the equidistribution of the Chung-Feller Theorem. 

Lemma 1 (Cycle Lemma). Given a sequence n = aia 2 a^ - ■ ■ a n of integers with a« < 1 
such that Yl T j=i cii = k > 0, there are exactly k values of i such that all partial sums of the 
cyclic permutation 7Tj = a i+1 a i+2 ■ • • a 2n +k a 'i • • • cti are positive. 
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This is a stronger claim than we need, so let us just restrict ourselves to G { — 1, 1}. 
For ease of computation for any p < n, let s(p) = X]j=i a «- Given a sequence ir as above, 
let us define a new order relation < on {0, 1, ... , n}. For any p, q G {0, 1, ... , n}, 

p<q, 

if s(p) < s(q) or if s(p) = s(q) and p > q. 

Additionally, let us define rrii for i — 0, 1, . . . , n such that rrii G {0, 1, . . . , n} and there 
are exactly i elements m G {0, 1, . . . , n} such that m < rrij. 

Now, let if we look at the jth cyclic shift of 7r, denoted 7Tj = aj + \aj +2 ■ ■ ■ a n a\ ■ • -cij, 
then the partial sums of iTj, s^(p), is given by s^(p) = Y7i={ a i+j — a i+i + a «+2 + • • • + a p , 
where indices are considered modulo n. This leads to the equation 



(8) si(p) 



s(p)~s(j), tfj<p<n; 
s(p) — s(j) + k, if < p < j. 



Proposition. For any sequence tt with steps a.i G { — 1, 1} and sum k = 1, the rriith cyclic 
shift of it has exactly i + 1 values of p such that s mi {p) < 0. 

Proof. Clearly, s mi (m,j) = 0, so there is at least one such value of p. If / < i, then let us 
check s mi (mi). If s(mi) = s(m.j) then mi > mi, so we know that s mi (nii) = s(m^) — s(mj) = 
0. If, however, s(rrii) < s(rrii) then s mi {mi) < s(m;) + 1 — s(mj) < 1, so s mi {mi) < 0. 

If / > i, then either s(rrii) > s{mi) or mi < mi and s(rrii) = s{rrii). In the first 
case since s mi (rrii) > s(mi) — s(rrii), clearly s mi (rrii) > 0. Otherwise, if mi < mi then 
s mi (mi) = 1 + s{mi) — s(mi) so since s(mi) = s(mi), s mi (mi) = 1. 

Thus there are exactly i+1 values of p, namely mo, mi, . . . , mi, such that s mi (p) < 0. □ 

The proof of the Cycle Lemma naturally follows from this proposition. 

Proof of Cycle Lemma. Use the order relation < to calculate for each % = 0,1, ... ,n 
and let = 7i mo . By definition, only s m °(mo) = 0. Since s m °{p) = s<f,(p — m ), only 
s (j,(0) < 0, so all partial sums of for p > are positive. □ 

Let n represent a Dyck path from (0, 0) to (2n + 1,1), by taking the path (1, a^). Let 
us restrict the ordering < to just the cyclic shifts of n that begin with an up-step. Since 
the order structure of {0, 1, . . . , 2n + 1} is maintained we can order the up-steps jk = rrii 
is the up-step with exactly k up-steps j such that j <jk for k — 0, 1, 2, . . . , n. Therefore, 
though there will be a total of i vertices on or below the rr-axis in the n^th cyclic shift of 
7r, there will be precisely k up-steps that start on or below the rr-axis. 

Since there are n + 1 total up-steps, there are n + 1 cyclic shifts of tt that start with 
an up-step. Now since there is precisely one cyclic shift for every path, then we have that 
the total number of paths with steps a« G {—1,1} and sum 1 that start with an up-step 
and have exactly k up-steps that start on or below the x-axis is given by the fraction ^j-j- 
of the total number of paths from (0, 0) to (2n +1,1) that start with an up-step. Thus it 
is the same as the number of paths from (1, 1) to (2n + 1, 1) which is precisely the number 
of paths from (0,0) to (2n,0); that is, ( 2 ™). 

Therefore the number of paths from (0, 0) to (2n +1,1) that start with an up-step and 
have exactly k up-steps that start on or below the x-axis, for k — 0, 1, . . . , n, is given by 
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^+t( 2 «) = C n - If we drop the initial up-step then we are left with a path from (0,0) to 
(2n, 0) with exactly k up-steps below the x-axis. So there are C n paths from (0, 0) to 
(2n, 0) with exactly k up-steps (and thus a total of 2k steps) below the x-axis for each k. 
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